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Abstract 

We discuss here a function that is part of the folklore around the P —?NP 
problem. This function g* is defined over all time-polynomial Turing 
machines; we trivially change it into a g which is defined over all Turing 
machines, and show that g overtakes all total recursive functions. We 
then show that g is dominated by a generalized Busy Beaver function. 
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1 Introduction 



Mathematics is sometimes full of unexpected turns, like a good mystery novel. 
The Busy Beaver game looks naive enough, and yet it leads us to an incredibly 
fast-growing, uncomputable function [fL2| . Also, who would expect that ques- 
tions about finite functions would depend on the existence of large cardinals? 

Sometimes mathematics behaves in a rather predictable, reasonable way. 
When one goes from real numbers to complex numbers to quaternions and then 
to the Cayley numbers, each step in that succession is marked (as it should 
be) by a slight modification of previously known properties. Yet in many cases 
the generalization of properties leads us into the unexpected: a contemporary 
example can be found in the mathematics of gauge fields. Given an Abelian 
gauge field, that field is always (locally, at least) derived from a gauge potential 
(a connection form) which is unique modulo gauge transformations. However 
when we deal with non-Abelian gauge fields, we see that the same field can be 
derived from physically different gauge potentials, that is, potentials that aren't 
related by a gauge transformation, not even locally j| ||, ■ Just by looking at 
the Abelian case we would never expect such a novel phenomenon to creep up in 
the non-Abelian case. Moreover this precise phenomenon is quite surprisingly 
related to a kind of transformation first used by Einstein in his theory of the 
asymmetric field [|[ ||. 

Can we say that we are dealing here with singularities in a kind of space 
of mathematical theories? We have in fact recently proposed this picture: we 
described a coding of formal systems by dynamical systems that bifurcate: for 
instance, if a given (undecidable) sentence holds, the system bifurcates; if not, it 
doesn't. Thus undecidable sentences are coded by singularities in those systems 
§• 

(However we are not sure whether the formal picture that opposes singular 
points to regular points is an apt characterization of our intuitions about the 
unexpected in mathematics; the picture may be still more complex.) 

The P =?NP question 

Let's consider a final example, which will lead us into this paper's main topic. 
That example stems from the P —1NP question. 

The P =?NP question starts from a very simple, rather obvious question 
that we can formulate as a short tale: 

Mrs. H. is a gentle and able lady who has long been the secretary of 
a large university department. Every semester Mrs. H. is confronted 
with the following problem: there are courses to be taught, profes- 
sors to be distributed among different classes of students, large and 
small classes, and a shortage of classrooms. She fixes a minimum ac- 
ceptable level of overlap among classes and students and sets down in 
a tentative way to get the best possible schedule given that minimum 
desired overlap. It's a tiresome task, and in most cases, when there 
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are many new professors or when the dean changes the classroom 
allocation system, Mrs. H. quite frequently has to check nearly all 
conceivable schedulings before she is able to reach a conclusion. In 
despair she asks a professor whom she knows has a degree in math: 
"tell me, can't you find in your math a fast way of scheduling our 
classes with a minimum level of overlap among them?" 

Mrs. H. unknowingly asks about the P =1NP question. She is able to 
understand its basic contours: there are questions for which it is easy to check 
whether a given arrangement of data fits in as a solution; however for the general 
case there are no known shortcuts in order to reach a solution. 

If you can guess the answer, it is easy to verify it. If you have to work it out, 
you'll most likely have to sweat your shirt until you reach an adequate answer. 
Most likely: for one always wonders whether there is some general way to get a 
quick solution. Who knows? 

Are there general shortcuts available? This is the whole point — in a nutshell; 
and this is the main query in the P =?NP problem. 

P = NP is, roughly, "there is a time-polynomial Turing machine that inputs 
each instance of a problem in a NP class and that correctly guesses (outputs) 
a solution for that instance." The negation P < NP is, "for each polynomial 
machine of Godel number m there is an instance x such that machine m guesses 
wrongly about x." 

The counterexample function /(m) is the function that enumerates each first 
instance x where a polynomial algorithm fails; it is recursive on the set of all 
polynomial machines and P < NP holds if and only if / is total. 

So, one of the possible approaches to this major problem is to focus on the 
counterexample function /. How are we to understand its properties? Perhaps 
by analogy: we use a tamer function g that more or less looks like /, and try to 
infer the properties of / from those of g. The chief property we want to consider 
is whether / can be proved to be total in Peano Arithmetic (PA), for in that 
case: 

If the counterexample function / is bounded by a prescribed total 
recursive function — for Peano Arithmetic, the fast-growing function 
F £o — then P < NP will be provable in Peano Arithmetic. 

For an adequately conceived g, it should be easy to check the desired property 
for g, and it should now be just a small step from g to /. 

(For F eo see [[l5) and references therein; for the counterexample function see 

0-; 

Then our main query is: is g bounded by such a function F eo ? If not, where 
do the properties of g will lead us concerning / ? 

Functions g and g* 

Remark 1.1 Let's motivate g* . Consider the following: 

go(m) = ,x x (x m < T) (1) 
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for non-negative integers m. This function is trivially recursive and total, and 
bounded by the exponential 2 X itself. 

Now suppose that we have defined some (necessarily nonrecursive) enumer- 
ation for all time-polynomial Turing machines (the poly machines). This enu- 
meration is noted, P m ,m £ u; we note V the set of all polymachines. The 
following function: 

g*{m)=n x [P m (x) <2 X ] (2) 

looks very similar to ([!]). In fact we have only changed the monomial x m for 
the output P m (x) of a poly machine, and that output is necessarily bounded in 
its length by some polynomial, that is, there is a positive integer k so that 

\Pm(x)\ < \x\ k . 

Therefore g* is also total. 

g* is usually presented as a kind of analogue of go in eq. (0) , and it is argued 
that, since go is bounded by an exponential, the same should be true of g* in 
cq. (||). Yes, a moment's thought shows that this should be the case. . . 

Wait a moment — is it really so? 

Let's ponder it: the relation between exponent k and machine index m is 
possibly quite complicated — and index m roams over a nonrecursive set. 
The analogy starts to break down here. 

The analogy breaks down right at the beginning. Can we pass over those 
differences and still have a rather "tame" behavior for g* as the one exhibited 
by go? n 

We show in this paper how wildly different is the behavior of g* when com- 
pared to go, despite the fact that the two functions are apparently very similar. 
The whole point, as we will see, is the relation between machine index m and 
the exponent k for the bounding polynomial. 

Analogies between / and g* 

The main analogy between the counterexample function / and g* is: both are 
defined over the set of all poly machines V and their values are given by the 
application of the \i operator to a recursive predicate on V that depends on the 
index m of a poly machine P m (for the case of / see Of course the main 
difference between both functions is that g* is intuitively total, while the whole 
point of the P =?NP question is whether / is total or not. 

Summary of the paper 

g* is part of the folklore around the P =1NP question. As noticed, it is some- 
times used as a simile for the behavior of the counterexample function to the 
P = NP (no question-mark here!) hypothesis. We consider its behavior in 
detail in the present paper in order to show how wildly different is the behavior 
of g* or its modification g (see below) when compared to that of go- 
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This paper shows that this function g* isn't well-behaved at all. In fact, 
given a natural version of it defined over all Turing machines, the modified g* 
(noted g) will oscillate in a wild way, and in its "ups" it will tower over all total 
recursive functions, so that the only immediate bounds we can find for it are in 
the realm of those functions which bound all total recursive functions. 

One of those upper bounds is a generalized Busy Beaver function, as we 
show in Section ^ of this paper. 

Preliminary comments 

Remark 1.2 If M. is the set of all Turing machines given by their Godel num- 
bers, the subset of all poly machines, V C Ai, isn't recursive. 
If we consider a function g given by: 

• g(m) — fi x [M m (x) < 2 X ], for m in V, 

• g{m) = 0, otherwise, 

we intuitively see that g is total. O 

We consider g instead of g* as it is clear that if there is some total recursive 
f that bounds <?, then we can obtain an adequately modified h* that will bound 
9*- 

Goals 

Our goal in the present paper is to discuss some properties of g. We will try 
to understand its relation to the set of all total recursive functions; everything 
proceeds within in an intuitive framework, where we will be able to compare it 
to a generalization of the Busy Beaver function . 

As noticed above, functions g* and g are part of the folklore around the P = 
1NP question; g* was suggested to the authors by F. Cucker and, independently, 
by W. Mitchell. 

2 Conventions on Turing and ^—machines 

We will use here what we call ^-machines, for "labeled" or "parametric" Turing 
machines; they are defined in this section. They are defined out of the usual 
Turing machines, which we now describe. They are simply Turing machines of 
which we keep track with the help of a tag given by a parameter. 

Remark 2.1 Suppose given the canonical enumeration of binary words 

0,0,1,00,01,10,11,000,001,... 

which code the empty word and the integers; they correspond to 

0,1,2,.... 

We take this correspondence to be fixed for the rest of this paper. □ 
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Turing machines 

Remark 2.2 We describe the behavior of the Turing machines we deal here to 
avoid ambiguities: 

1. Turing machines are defined over the set A\ of finite words on the binary 
alphabet A 2 — {0, f }. 

2. Each machine has n > states so, si, . . . , s„_i, where so is the final state. 
(The machine stops when it moves to so.) 

We allow for a machine with states and an empty table; it is discussed 



below in Remark 2.6 



3. The machine's head roams over a two-sided infinite tape. 

4. Machines input a single binary word and either never stop or stop when 
the tape has a finite, and possibly empty set of binary words on it. 

5. The machine's output word will be the one over which the head rests if 
and when so is reached. (If the head lies on a blank square, then we take 
the output word to be the empty word, that is, 0.) d 



Remark 2.3 The Turing machine inputs a binary string [x\ and (if it stops 
over L^J ) outputs a binary string [y\ . The corresponding recursive function 
inputs the numeral x and outputs the numeral y. Whenever it is clear from 
context, we write x for both the binary sequence and the numeral. □ 



Remark 2.4 We will use upper case and lower case sans serif letters (such 
as M,...) for Turing machines (and also for ^-machines). If M„ is a Turing 
machine of Godel number n, its input- output relation is noted M n (x) = y. 



Remark 2.5 Turing machines are given by tables. We can write the tables as 
code lines separated by blanks U, such as £ U £' U . . . U 

Let 3 be one such set of code lines separated by blanks. Let 3' be obtained 

out of 3 by a permutation of the lines £, Both 3 and 5' are seen as 

different machines that compute the same algorithmic function, that is, in this 
case, if fs (/=') is computed by 3 (5'), then fs = fs'- ^ 



Remark 2.6 We define the empty or trivial machine to be the Turing machine 
with an empty table; we take it to be the si mples t example of the identity 
machine, again by definition. (See also Remark 2.16.) □ 
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^—machines 

Remark 2.7 We describe here the ^-machines. Consider a two-tape Turing 
machine Q where the input is written over tape 1, while tape 2 comes with a 
possibly empty binary string n. n is the machine's label, or parameter. 

• One easily sees that each Turing machine M m is simulated by infinitely 
many ^-machines: write an arbitrary n on tape 2 and write all instructions 
for M m solely for tape 1. 

• For the converse, if M m is an arbitrary Turing machine, and if (by an abuse 
of notation) r is the machine that polynomially computes the onto pairing 
function wxw^w, then the set of all coupled machines M TO o r(n, x), all 
n, m, represents the set of ^-machines. 

• Moreover, to avoid a pre-fixed label or parameter, we can use the family 
of constant Turing machines i„ that print n over any input, and couple it 
to the arrangement above, that is M m o t o i„ (in order to generate n), to 
obtain a set of emulated ^-machines. 

The Godel number k — c(m, #r, n) (# . . . denotes the Godel number of . . .) 
is given by a primitive recursive c fllf . d 

^From that we have: 

Definition 2.8 An (-machine is a pair (n, M m ), where n £ lo and M m is a 

2-tape Turing machine of Godel number m, with n written on tape 2. □ 



Remark 2.9 ^From now on whenever we write "machine" we will mean "£- 
machine," unless otherwise stated, d 



Remark 2.10 So, ^-machines are just a convenient bookkeeping device to keep 
track of families of Turing machines which are labeled by some parameter. How- 
ever, under the guise of ^-machines they allow us the simple result stated in 
Lemma 2.13. □ 



Godel numbering 

(We use here Kleene's "method of digits," without however "closing the gaps" 
to squeeze out the junk; see 0, p. 178.) 

Definition 2.11 Each (.-machine is coded by the pair (n, m), where both n,m 
range over the whole ofoj. d 



Remark 2.12 We refer to (n,m) as the Godel number of the (-machine, or 
(-Godel number, or (-index. In order to compute some values for g as we do 
here one needs to know the £-index for some poly machines which are given as 
^"-machines, d 
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A lemma on Godel numbers 

In order to compute pieces of g one needs to know the Godel numbers for some 
polynomial Turing machines. We use the following result: 

Lemma 2.13 Let M^ n m ^, new,™ fixed, be a family of (.-machines. We 
can explicitly construct an (-Godel numbering for the set of all (-machines 
so that the (-indexes for the M^ nTn ^, n £ u>, are given by a linear function 
N(n) = an + b. 

Moreover there is a primitive recursive map from that (-Godel numbering to 
standard Godel numberings. Therefore, ifk is such a map, the induced N'(n) = 
k o N(n) is primitive recursive. 



Remark 2.14 Recall that the length \x\ of a finite string x is the number of 
letters in x. □ 



Proof of Lemma 2.1c : The idea goes as follows: 1547, 2547, 3547, . . . are in 
arithmetic progression with r = 1000 = 10 3 = io lcn s th ( 547 ). We extend this to 
the proof of our lemma. 



Recall the ordering of binary words as described in Remark 2.1 



• ^-machines are fully characterized by pairs (n,m), where n is the param- 
eter and m the 2-tape Turing machine's code. 

• If x n and the corresponding binary words (numerals), code pair 
(n, m) as the word XnlQx^n, where if 

-.. — ™0 1 2 h 



then 



• So, given an arbitrary binary word, there is a simple recursive procedure 
to decide whether it codes an ^-machine (n, m) or not: go to the rightmost 
end and (by going backwards) see if we have a duplicated word that ends 
in the sequence 10. If so, we have a coded binary word for one of our 
pairs. 

• Map the in-between mumbo-jumbo onto the trivial ^-machine. 

• This coding is certainly recursive and 1-1 onto the natural numbers. 

• Finally, if \x m \ is the length of x m , then = 2\x m \ + 2, and therefore, 
between l?-machine (n,m) and ^-machine (n + l,m) there are 2 2 ^ Xm ^ +2 
binary words. 
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Follows the lemma, and the Godel number N(n) of the ^-machines just 
described is given by an arithmetic progression N(n), of ratio 2 2 ^ Xm ^ +2 . 



^From Remark 2.7 one immediately sees that there are primitive recursive 
maps that lead from that coding to more usual Godel numberings, such as the 
original one used by Godel, or Kleene's "method of digits," so that N'(n) is also 
primitive recursive, d 

Remark 2.15 So, it is enough to know that the Godel numbers N for the 
families of machines we will be using here are related to the machine's "tag" or 
parameter by N = q(n), where q is primitive recursive, that is, Ackermann's 
function F w dominates q. D 

Quasi— trivial machines 

We use ^-machines; therefore here "machine" stands for ^-machine. 

Remark 2.16 Again the trivial ^-machine is the one with the empty table; 
more formally, ^-machine (0,0). D 

We will now consider a set of, say, nearly trivial machines. They are all 
polynomial. Recall that the operation time of a Turing machine is given as 
follows: if M stops over an input x, then the operation time over x, 

tM = \x\ + number of cycles of the machine until it stops. 



Example 2.17 

• First nearly trivial machine. Note it 0. inputs x and stops. 

to = \x\ + moves to halting state + stops. 

So, operation time of has a linear bound. 

• Second nearly trivial machine. Call it 0'. It inputs x, always outputs 
(zero) and stops. 

Again operation time of 0' has a linear bound. 

• Quasi— trivial machines. A quasi-trivial machine Q operates as follows: 
for x < xo, xq a constant value, Q = R, R an arbitrary total machine. For 
x > x , Q — or 0'. 

This machine has also a linear bound. D 

We will use several ^-families of quasi-trivial machines. Please allow for 
some abuse of language here; we need it in order to avoid cumbersome notations. 
(For instance, the machines defined below depend on the Godel numbers of the 
machines that appear as their subroutines.) 
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Remark 2.18 Now let H be any fast-growing, superexponential total machine. 
Let H' be another such machine. Form the following family Q H ( ra ) of quasi-trivial 
^-machines with subroutines H and H': 

1. H(n) = k(n), all n, is the way we introduce the parameter in the family. 

2. If x < k(n), Q H > H >(a;) = H'(a;); 

3. If x > k(n), Q H > H >(» = 0. □ 



For function g in Remark 1.2 



Proposition 2.19 If N(n) is the Godel number of an i-machine as in Remark 



2.18, then g(N(n)) = k{n) + 1 = H(n) + 1. 



Proof: Use Lemma 2.13. Recall that N(n) = an + b, a and b constants that 
depend on the actual algorithms (i.e., in the Godel numbers) for H, H' and Q. 
□ 



3 A domination lemma 

Recall: 

Definition 3.1 For /, g : co — > lo, 

f dominates g ^Dcf 3y Vx (x > y — > f(x) > g(x)). 
We write f >- g for f dominates g. D 

Our goal here is to prove the following result: 
Proposition 3.2 For no total recursive function h does h >- g. 

Proof: Suppose that there is a total recursive function h such that h y g. 

Remark 3.3 Given such a function h, obtain another total recursive function 
h' which satisfies: 

1. h' is strictly increasing. 

2. For n > no, h'(n) > h(?n — for integer values of the argument of g. 
That is, W{an + b) > h(n). □ 

Constants a, b are from the Godel numbers of the quasi-trivial machines 



described in Remark 2.18 



Lemma 3.4 Given a total recursive h, there is a total recursive W that satisfies 



the conditions in Remark 3.S 
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Proof: Given h, obtain out of that total recursive function a strictly in- 
creasing total recursive h*. Then if, for instance, is Ackermann's function, 
h' = h* o F" will do. □ 



\h',K.n 



^From Lemma B. 13 , we have that the Godel numbers #[Q ' '"] of the Q 
are given by #[Q h '' K ' n ] — an + b, a, b € w. 

Therefore, g(an + b) — h'(n) + 1. From Lemma [2.13 , Remark |3.3| and 
Lemma 3.4 we conclude our argument. If we make explicit the computations, 
for q(n) — an + b (as the argument holds for any strictly increasing primitive 
recursive q): 

ff (q(n)) = h , (n) + l = h*(F ( ,(n)) + l, 



and 



h*(MrO) > h*(q(n)). 



For TV = q(n) (see Remark 2.15), 

g(N) > h*(N) > h(TV), all TV. 

Therefore no such h can dominate g. d 

Remark 3.5 No need to emphasize that we could have used ordinary Turing 
machines, as at the end of Remark [2.7| ; then N(n) would be primitive recursive 
on n. Q 



4 An extended Busy Beaver function 

So g overtakes all total recursive functions. Can we put a ceiling to it? Yes: 
and that ceiling is a kind of generalized Busy Beaver function (which depends 
on g itself). 

We now sketch an argument that shows that this generalized Busy Beaver 
function dominates g. 

The Busy Beaver function B can be easily and intuitively defined. If TV is the 
number of states of a given Turing machine, then B(N) is the biggest number 
of Is a TV-state Turing machine prints over a tape filled with Os as its input. 

Remark 4.1 We are going to generalize that function to B' as follows: 

• First, notice that there is a simple relation between a Turing machine's 
table and the number |^ of its states. 

More precisely, from that reference we see that a TV-state Turing machine 
with a binary alphabet has a table given by a TV' = (3TV + 1) x 4 matrix. 

So we can define a primitive recursive relation f from the Godel numbers 
to the set of states of Turing machines so that if Godel numbers m! > m, 
then states f(m') > f(m). 
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• We can therefore see the Busy Beaver function B as a monotonic nondc- 
creasing function of Turing machine Godel numbers. 

• Now for each Godel number m for a polynomial Turing machine, g(m) is 
effectively computed. 

• We will define B'(m) as: the maximum number of Is printed by a N(m)- 
statc Turing machine over a tape with only Os and over a tape with all 
inputs < g(m) + 1. 

• Clearly B'(N) > B(N) and B'(N(m)) = B(N(m)) if the Turing machine 
with Godel number m isn't a polynomial machine, d 

Proposition 4.2 For all m, B'(m) > B(m). □ 

Proposition 4.3 B' dominates g. □ 

Remark 4.4 We conjecture that g is of the same order of growth as the Busy 
Beaver function itself. □ 

We intend to apply those ideas to the counterexample function / in a follow - 
up paper. 
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